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By means of a simple example it is demonstrated that the task of finding and identifying certain 
patterns in an otherwise (macroscopically) unstructured picture (data set) can be accomplished 
efficiently by a quantum computer. Employing the powerful tool of the quantum Fourier transform 
the proposed quantum algorithm exhibits an exponential speed-up in comparison with its classical 
counterpart. The digital representation also results in a significantly higher accuracy than the 
method of optical filtering. 
PACS: 03.67.Lx, 03.67.-a, 42.30.Sy, 89.70.+C. 



I. INTRODUCTION 

Pattern recognition is one of the basic problems in ar- 
tificial intelligence, see e.g. [1]. For example, generally a 
short look at a picture like the one in Fig. 1 suffices for the 
human brain to spot the region with the pattern. How- 
ever, it is a rather non-trivial task to accomplish the same 
performance with a computer - in particular if the ori- 
entation and the structure of the pattern are not known 
a priori and it is not perfect (in contrast to the one in 
Fig. 1). 

Besides the detection and localisation of pattern (for 
example identifying seismic waves in the outputs of seis- 
mographs) the comparison and matching of the observed 
pattern to a set of templates (such as face recognition) is 
another interesting question. Usually these problems are 
solved with special classifiers, such as neuronal networks 
or Fourier analysis, etc., cf. [1]. 

The specific properties of the task of pattern recog- 
nition (one may consider many combinations simultane- 
ously and is interested in global features only) give raise 
to the hope that quantum algorithms may be advanta- 
geous in comparison with classical (local) computational 
methods (with a unique entry). 

During the last decade the topic of quantum infor- 
mation processing has attracted increasing interest, see 
e.g. [2] for a review. It has been shown that quan- 
tum algorithms can be enormously faster than the best 
(known) classical techniques: Shor's factoring algorithm 
[3], which exhibits an exponential speed-up relative to 
the best known classical method; Grover's search rou- 
tine [4] with a quadratic speed-up; and several black-box 
problems [5-8] , some of which also exhibit an exponential 
speed-up; etc. 

In the following a quantum algorithm for the detec- 
tion, identification, and localisation of certain patterns in 
an otherwise (macroscopically) unstructured data set is 
presented. It turns out that this method is exponentially 
faster than its classical counterpart too. Furthermore, it 
outperforms the (also extremely fast) method of optical 
filtering in terms of accuracy. 

The advantages of using quantum memories and com- 
puters for the aforementioned task of template match- 



ing (which is somewhat different from pattern detec- 
tion/localisation) have been discussed in [9,10]. Note, 
however, that the necessity of loading the complete data 
set into a quantum memory may represent a drawback 
(cf. the next Sections). In Ref. [11] an algorithm for data 
clustering (in pattern recognition problems) is developed, 
which is based on/inspired by principles of quantum me- 
chanics - but does not involve quantum computation. 




FIG. 1. A 32 x 20 array half-filled with points, i.e. £> = 1/2. 
In an 8 x 8 square they form a pattern (hence x ~ 1/10) j 
otherwise the points are randomly distributed. 

II. DESCRIPTION OF THE PROBLEM 

Let us consider a rectangular N x M array of 
unit cells which are either absorptive (black) or reflec- 
tive/transparent (white), cf. Fig. 1. In the following we 
shall assume that these cells are perfectly absorbing or 
reflecting/transmitting, respectively, but reasonable de- 
viations from this idealised behaviour resulting in finite 
absorption, reflection, and transmission coefficients do 
not alter the following considerations. 

The white cells are distributed with a roughly homoge- 
neous density g < 1 (for example g = 1/2) and will later 
be called points for brevity, i.e., the NxM array contains 
P = gNM points. A small fraction \ of these points (say 
X = 1/10) forms a pattern in a connected (but not nec- 
essarily rectangular) region of the size xNM, cf. Fig. 1. 

In contrast to Fig. 1 the pattern does not need to be 
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perfect - average symmetries are sufficient, see Section V 
below. For simplicity we restrict our consideration to 
linear (but again not necessarily rectangular) patterns. 
I.e., the patterns are supposed to be (approximately) 
invariant under at least two symmetry transformations 
described by global translations into different directions. 
Geometrically speaking, the angles within the pattern do 
not change, think of, for example, a set of parallel and 
equidistant lines or a periodic repetition of small elements 
as in some wall papers, etc. 

Let us suppose that we know N, M, and g, for these 
quantities can easily be measured having at hand the 
N x M array of unit cells (which might be some sensi- 
tive nano-device, for example) - but we neither know the 
size x °f the pattern (or whether there is a pattern at 
all) nor its structure and orientation (except that it is 
linear). The task is to find an algorithm for extracting 
this information. 

In principle, we may find out the position of all points 
(white cells) by shining appropriately focused light beams 
on the array and measuring the reflection or transmission. 
However, the array is also assumed to be very sensitive 
and each absorbed photon causes a certain amount of 
damage - similar to the Elitzur-Vaidman problem (detec- 
tion without destruction), cf. [12]. Therefore, the number 
of incident photons should be as small as possible. 

At the same time, we wish to obtain the characteris- 
tic parameters of the pattern with maximally attainable 
accuracy and minimal effort (i.e. number of subsequent 
operations) . In order to cast the above requirements into 
a well-defined complexity theoretic form we consider the 
limit of very large arrays N, M ^> 1. Furthermore, we as- 
sume N — 2" and M — 2 m with integers n, m g N allow- 
ing for a binary representation. (Otherwise we may en- 
large the array accordingly or consider a part of it only.) 



III. DATA PROCESSING 

It is probably most convenient to view the data set as 
a (quantum) black box 
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where the input state encodes the coordinates x and y 
of a potential point (a white cell) in the array as n- and 
m-qubit strings, respectively, together with a third one- 
qubit register |0) needed for unitarity. The output func- 
tion f(x,y) assumes the value 1 if there is a point at these 
coordinates and if not. 

As a possible physical realization one might imagine 
a configuration like the following: a focused light beam 
passes n + m controlled refractors (e.g. non-linear Kerr 
media) which change its direction by definite angles (fj if 
the control qubit is |1) and do not affect it otherwise. For 
suitably chosen angles (ip^ ,y = <Pq ,v jl? with j g N) the 



final direction of the beam encodes the position (x, y) on 
the N x M array if the digits of the coordinates \x), \y) 
are inserted as the control qubits. Shining the so di- 
rected light beam (which may consist of only one pho- 
ton) through an aperture mask like Fig. 1 on a detector 
reproduces the action of the black box in Eq. (1). 

Note that, in this realization, it is not necessary to 
load the complete (classical) information of the array into 
a quantum memory. This would slow down the whole 
process drastically and hence represent a serious caveat. 
(The same problem limits the region of applicability of 
Grover's quantum search procedure, for example.) In 
addition, it is hard to see how this could be done without 
shining a relatively large number of photons on the array 
and thereby destroying it, cf. Section II. 

Moreover, the involved number (n + m) of devices (re- 
fractors) is very small in this case. Each refractor acts 
roughly similar to a controlled swap or a switch gate 
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with a, f3 — 0,1; but the series connection of these re- 
fractors allows for a very efficient data processing. 



IV. QUANTUM ALGORITHM 

Now we may apply the well-known trick of inquiring 
all possible values of the coordinates (x, y) in only one 
run of the black box (quantum parallelism) . To this end 
we prepare a state as the superposition of all possibilities 
by using the Hadamard gate H. For a single qubit in 
the |0), |1) basis the (unitary) Hadamard gate H acts as 
H\0) = (|0) + |l))/\/2- By multiple application of H and 
running the black box (only once) we arrive at the desired 
superposition 
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Now measuring the third register |/) and obtaining 1 pre- 
pares the state as a superposition of the coordinates 
|a;) and \y) of all points. Assuming an ideal black box, 
the outcome / = would just imply the complementary 
set q — > 1 — g. However, in the presence of absorbing 
units as described in Section II, the resulting entangle- 
ment with the state of the absorptive cells would destroy 
the coherence completely in the case / ^ 1. 

It will be advantageous to reorganise the array by di- 
viding it into M rows of length N and combining them 
all to one string of length S = NM. The coordinate of 
a given point is now one n + m = s-digit binary number 
z = x + Ny (instead of two numbers x and y). The corre- 
sponding quantum state is simply given by \z) — \x)®\y). 
In terms of this representation the quantum state |\E') 
prepared by the measurement of / = 1 reads 
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where > z\ > S — 1 denotes position of Z-th point (as a 
s-digit binary number). 

The next basic part of the quantum algorithm is the 
application of the quantum Fourier transform (QFT). It 
acts on a basis element like \z) = 1 110100 - - - ) as 
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Hence the superposition state | in Eq. (4) will be trans- 
formed into 

k=o j=i b v6 V * J 

Assuming a distribution of points zi without any (macro- 
scopic) pattern (e.g. purely random) there will be no priv- 
ileged values of k (except k = 0) and the measurement 
of k > yields just noise. However, the presence of a 
pattern within the data set introduces a typical length 
scale and thus leads to peaks of the factor in front of 
\k) at certain values of k — which hence can be used as 
an indicator, see the next Section. In this way the QFT 
efficiently solves the problem of feature selection - i.e., 
extracting a small amount of relevant quantities (such as 
wave-numbers) from a large data set. 



V. PATTERN LOCALISATION 

The task of pattern recognition does not only include 
the mere detection of a pattern but also its localisation 
and classification. The comparison with a given set of 
templates will not be discussed here, see e.g. [9,10]. The 
next step is to extract informations about the pattern 
from the peaks in the measurements of k ~ in close anal- 
ogy to the reconstruction of the probe structure from the 
Laue diagram in diffraction experiments. 

Consider, for example, a simple pattern consisting of 
parallel lines like the one in Fig. 1. In this case the basic 
quantities are the distance D = const of the lines as well 
as their orientation as described by the (constant) angle 
— 7r/2 < $ < 7r/2. Here t? denotes the deviation of the 
line from a vertical one, i.e., after going down R rows the 
sequence is shifted by tant?i? columns to the right. So 
the points z marking the centre of a particular line are 
given by 



z = z + [N(N + tantf)] int cger ■ 
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Note that (in contrast to Fig. 1) the lines do not need 
to be perfect - it is sufficient if, in average, the density 
of points within a line- width of, say, D/2 deviates by a 
finite amount Ag (e.g. Ag =1/4) from the mean g. 



According to Eq. (6) every row of the pattern generates 
peaks at 
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with the second term denoting their width. Both, 
the position and the width of the peaks can be ob- 
tained from the associated Laue function £(£, k) = 
sin 2 (tt£k)/ sin 2 (ttk) with k = kcos$S/D and £ = 
0{N s /\) in this case. 

However, the sum of all rows interferes constructively 
only if k is fine-tuned according to 
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which again can be obtained from the associated Laue 
function with now n = k(N +t&n'&) / S and £ = 0{M s /x). 
The strongest peaks in the measurements of k occur for 
values which satisfy both conditions (8) and (9) simulta- 
neously. Accordingly, the wave-numbers of those poten- 
tial peaks read 
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where the corresponding width/uncertainty has been 
omitted. 

However, for large enough D, not every peak in Eq. (8) 
will contain wave-numbers matching Eq. (9) in general. 
The condition for this to happen is that the integer (N) 
in Eq. (8) multiplied by cost? N/D is again close to an- 
other integer - the one in Eq. (9) - within an accuracy 

of0(i/[zvxD- 

Therefore, not all the fc-values in Eq. (10) do necessar- 
ily represent large peaks - the first few of them may be 
suppressed. On the other hand, the larger D is the more 
fc-values of potential peaks in Eq. (10) are contained in 
the interval < k < S. Consequently, from the number 
D/ cost? of potential peaks in Eq. (10) there must be at 
least a few 0(l/^/x) which match both conditions (8) 
and (9). 

Determining the largest common factor of all the wave- 
numbers of the peaks (within the given accuracy) we ob- 
tain a value for the expression in Eq. (10) or integer mul- 
tiples of it. Unfortunately, this information alone is not 
sufficient for extracting D and #. To this end we may 
simply transpose the array (e.g. Fig. 1) by interchanging 
rows and columns N <-> M and run the same algorithm 
again. Since transposing corresponds to t? — > n/2 — t? the 
wave-numbers of the peaks are now 
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Combining the possible values for D/ cost? from Eq. (10) 
with the ones for D/sint? from Eq. (11) we obtain ap- 
proximate candidates for D and t?. Comparison with 
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the fine-tuning conditions such as Eqs. (8) and (9) as 
well as the knowledge of which peaks are suppressed and 
which are not allows us to extract the actual values of D 
and d with high (in fact, maximally attainable) precision 

o(i/Vx)- 

The height of the peaks can be estimated by means of 
Eq. (6). In the resonance case the sum includes S\Ag 
constructively interfering addends which lead to an am- 
plitude of order 0(x \fo)- Thus the probability p of 
measuring the peaks in Eqs. (10) and (11) is given by 

,_ (JsM), (12) 

i.e. independent of N and M - and therefore drastically 
enhanced over the (random) noise. 

Consequently if a number Q of measurements yields 
one or more pronounced peaks besides k = then there 
exists a pattern larger than Xmin — 0(1/ VQ) and oth- 
erwise not (at least with a very high probability). Quite 
reasonably, the smaller the pattern, i.e. x, the longer one 
has to search. 

After having solved the feature selection problem effi- 
ciently by the quantum algorithm the remaining analysis 
(peak finding and stopping criteria, etc.) of a small (inde- 
pendent of N and M) amount of measured wave-numbers 
can be accomplished by a classical algorithm. 

In this way one can determine the size of the pattern 
X by the frequency of measuring the peaks at k and k' 
(and their width). Its structure, i.e. the values of D and 
i?, can be inferred from the location of the peaks. 

Having found the parameters D, and x 01 the pat- 
tern it can be localised easily - for example by dividing 
the total N x M array into smaller pieces (according to 
X) and running the same quantum algorithm again in the 
smaller domains. 

More complicated (but still linear) patterns, such as a 
regularly recurring pictures* (like in many wall papers), 
possess more than one characteristic angles t? in general 
and therefore generate a richer peak structure - but the 
main idea remains the same. 



VI. COMPLEXITY ANALYSIS 

Let us estimate the size of the proposed algorithm, 
i.e. the number of involved computational steps, and 
compare it with the classical method, in the limit S — > oo 
while g, Ag, and x remain finite. 

In view of Eq. (12) we need only a few O(S ) queries 
of the black box in order to find a pattern of a given size 
with high probability. Clearly, this is not possible with 
any classical algorithm - demonstrating the advantage of 



* Again, these pictures do not need to be perfect - average 
features are sufficient. 



the global quantum computation over the local (only one 
point at the time) classical technique. Since the num- 
ber of queries of the black box corresponds to the total 
amount of photons shining on the array, the quantum 
algorithm causes less damage (cf. Section II) than any 
classical method, see also [12]. 

Given the explicit physical realization of the black box 
described in Section III it is also possible to compare 
the total number of fundamental manipulations. For the 
preparation of the initial state in Eq. (3) one has to apply 
the Hadamard gate m + n = s = log 2 S times. The black 
box itself involves about the same number of operations. 
The quantum Fourier transform (QFT) in Eq. (5) re- 
quires 0(log 2 S) steps for obtaining the exact result and 
is even faster 0(log 2 S) if we measure [2] the outcome 
immediately afterwards - as it is the case here. 

In contrast, the best known classical algorithm, the 
fast Fourier transform (FFT), implements 0(S log 2 S) 
operations and is therefore exponentially slower. Note 
that, since we do not know the typical "wave-numbers" 
k associated with the pattern a priori, we would have 
to calculate the FFT for a large number O(S) of possi- 
ble values of k - whereas the QFT accomplishes all this 
simultaneously and automatically gives us the values k 
with the largest amplitudes in average measurements. 

Unfortunately, it cannot be excluded here that perhaps 
a classical algorithm exists which is better than the FFT 
and may compete with the proposed quantum algorithm 
(though not in the number of queries of the black box). 
But since the processing of the coordinates of only one 
single point already requires 0(\og 2 S) operations, one 
would have to find the pattern by considering a few O(S ) 
points in order to outrun the quantum computer - which 
is apparently not possible. 

Nevertheless, in certain situations - e.g. for perfect 
lines with D = 0(1) such as in Fig. 1 - it is possible 
to design an appropriate classical algorithm which de- 
termines $ by using only 0(log 2 S) points with q > 1, 
which requires 0(log 2 +1 S) computational steps. In this 
case the speed-up is merely polynomial. 

In the more general case, however, where D can be 
very large D » 1 and the lines are not perfect, it is 
really hard to see how one might extract basically the 
same information as the FFT in (polynomially) logarith- 
mic time classically. (An 0(V~S) algorithm, for example, 
would also be exponentially slower than the QFT.) 

Assuming that there is indeed no such classical algo- 
rithm (or, that the set of patterns for which the Fourier 
transform is the best classifier is not empty) the problem 
under consideration represents another example for the 
(conjectured) exponential speed-up of quantum compu- 
tation - based on the power of QFT for problems related 
to (quasi) periodical structures (which is also the basic 
ingredient for Shor's algorithm [3]; though in that case 
the periodicity is exact - in contrast to the situation con- 
sidered here). 

Of course, this speed-up has only been possible since 
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it was not necessary to load the complete array into a 
quantum memory (cf. [9,10]) - this would have involved 
about O(S) operations and thereby lead to a drastic (ex- 
ponential) slow-down. 

VII. SUMMARY AND OUTLOOK 

In summary, quantum algorithms are capable of solv- 
ing certain problems of pattern recognition (i.e. detec- 
tion, localisation, and classification) besides template 
matching [9,10] much faster 0(log 2 S) than their clas- 
sical counterparts [13]. 

Although this has been demonstrated explicitly in this 
article for line patterns only, the basic idea applies to 
more difficult (but still linear) patterns as well. (In some 
sense, this idea arises from the often successful approach 
to copy the elegant solutions which nature reveals to us - 
such as the possibility of distinguishing a crystal from an 
amorphous material via X-ray diffraction or the method 
of optical filtering, see the next Section.) The investi- 
gation of non-linear patterns, such as a set of concentric 
circles, is apparently more involved and probably requires 
adapted methods. 

Together with the findings in Refs. [9,10] the results 
of the present article give raise to the hope that quan- 
tum algorithms are also advantageous for more general 
pattern recognition problems. 

VIII. OPTICAL FILTERING 

Interestingly, the manipulations involved in the pro- 
posed quantum algorithm - apart from the physical re- 
alization of the black box itself - can be accomplished 
(at least in principle) with present-day optical devices: 
Hadamard gates as well as the calculation of the quan- 
tum Fourier transform with subsequently measuring the 
outcome can be realized using beam splitters and classi- 
cally controlled phase shifters, cf. [2]. This observation 
leads to the question of whether one could achieve a sim- 
ilar performance with purely optical techniques. 

Indeed, the method of optical filtering reproduces some 
key features of the proposed quantum algorithm. Shin- 
ing a plane wave with an appropriate wave-number on 
an aperture mask like Fig. 1 the far field diffraction am- 
plitudes are given by the Fourier transform of the object 
(e.g. Fig. 1) in terms of the perpendicular wave-number. 
Using an ordinary convex lens one may convert these 
wave-numbers into positions in the focal plane of the lens, 
see e.g. [14]. 

In this way the described apparatus effectively calcu- 
lates the desired Fourier transform. However, achieving 
the same accuracy as the proposed quantum algorithm, 
i.e. 0(log 2 S) digits for $ and D, would require expo- 
nential precision and hence exponential effort: number 
of detectors in the focal plane, etc. Another problem is 



the fact that D is not known a priori and may vary over 
several orders of magnitude - which makes it difficult to 
select a suitable wave- number for the incident light. 

These obstacles are caused by the main difference be- 
tween optical filtering and the proposed quantum algo- 
rithm. In optical filtering, the relevant quantities (such 
as wave-number, position in the focal plane, and D) 
are directly, i.e. linearly, related to each other whereas 
the quantum algorithm employs the digital representa- 
tion and therewith outperforms the former method. 

On the other hand, if we happen to know the order of 
magnitude of D in advance and would be willing to settle 
for a limited accuracy of only a few O(S ) digits, we may 
detect the pattern with optical filtering by using only a 
few O(S ) photons. Although much less powerful, i.e. ac- 
curate, this method would be significantly faster than the 
proposed quantum algorithm, which also uses only a few 
photons shining on the pattern - but it requires C(log 2 S) 
photons (or, more generally, qubits) for the subsequent 
data analysis (QFT, etc.). (This enhancement in speed - 
although at a certain cost - reflects the fact that a quan- 
tum field theoretical object such as a photon has more 
degrees of freedom than just one single qubit.) 

IX. APPENDIX 

If there were no absorptive cells in the array at all 
but only (perfectly) reflective (white) and transparent 
(black) ones, one could (in principle) realize the following 
generalisation of the black box in Eq. (1) 
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with a = 0, 1. Here ® denotes summation modulo 2, i.e., 
1©0 = 0©1 = 1 and 0®0=lffil = 0. In this case one 
may improve the quantum algorithm by sending the su- 
perposition state in Eq. (3) with the third register being 
(|0) - |1))/V2 instead of |0) to the black box. The third 
register does not change during this procedure and the 
resulting state encodes the information about the points 
in the array in the phases (+1 or —1) instead of the am- 
plitudes (1 or 0) 

i*> = ^=E(-i) /(z) i*>- ( 14 ) 

Assuming g = 1/2, the quantum Fourier transform of 
this state has certain advantages over the one in Eq. (4) 
- one gets rid of the (useless) peak at k = and enhances 
the probabilities of the other peaks by a factor of 2. Un- 
fortunately, it is hard to see how one might be able to 
exploit this advantage of the modified black box in the 
presence of absorbing units. 
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